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Abstract. For many diseases, logistic and other constraints often render large in- 
cidence studies difficult, if not impossible, to carry out. This becomes a drawback, 
particularly when a new incidence study is needed each time the disease incidence 
rate is investigated in a different population. However, by carrying out a prevalent 
cohort study with follow-up it is possible to estimate the incidence rate if it is con- 
stant. In this paper we derive the maximum likelihood estimator (MLE) of the overall 
incidence rate, A, as well as age-specific incidence rates, by exploiting the well known 
epidemiologic relationship, prevalence = incidence x mean duration (P = A x (i). We 
establish the asymptotic distributions of the MLEs, provide approximate confidence 
intervals for the parameters, and point out that the MLE of A is asymptotically most 
efficient. Moreover, the MLE of A is the natural estimator obtained by substituting the 
marginal maximum likelihood estimators for P and /i, respectively, in the expression 
P = A x (j,. Our work is related to that of Kciding (1991, 2006), who, using a Markov 
process model, proposed estimators for the incidence rate from a prevalent cohort study 
without follow-up, under three different scenarios. However, each scenario requires as- 
sumptions that are both disease specific and depend on the availability of epidemiologic 
data at the population level. With follow-up, we are able to remove these restrictions, 
and our results apply in a wide range of circumstances. We apply our methods to data 
collected as part of the Canadian Study of Health and Ageing to estimate the incidence 
rate of dementia amongst elderly Canadians. 
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1. Introduction 

In an incidence study, whose goal is to estimate a disease incidence rate, a cohort 
of initially disease-free subjects is followed forward in time. The subjects are monitored 
closely and for those who develop the disease their approximate times of disease onset 
are recorded. Often, as part of an incidence study, these diseased subjects are followed 
until "failure" or censoring. The data collected from such an incidence study may then 
be used to directly estimate both the disease incidence rate and the survival function 
for the time from onset to failure. The estimators of the incidence rate and the survival 
function from such data are standard. 

For many diseases, however, logistic and other constraints often render large in- 
cidence studies difficult, if not impossible, to carry out. This becomes a drawback, 
particularly when a new incidence study is needed each time the disease incidence rate 
is investigated in a different population. Nevertheless, by carrying out a prevalent co- 
hort study with follow-up it is possible to estimate the incidence rate if it is constant, 
thus avoiding the problems associated with incidence studies. In this paper we derive 
the maximum likelihood estimator (MLE) of the overall incidence rate, A, as well as 
age-specific incidence rates from data collected as part of a prevalent cohort study with 
follow-up. We exploit the well known epidemiologic relationship, prevalence = incidence 
x mean duration (P = A x /i), to suggest that the likelihood be derived as a function of 
the vector (P, /i). Once the MLE, (P, fi) of (P, /x), is obtained, the MLE of A = — follows 
by invariance. A similar approach may be used to find the MLEs of age specific incidence 
rates. The asymptotic distributional properties of the estimators may be obtained by 
modifying previous results for the MLE of the survival function, based on survival data 
from a prevalent cohort study with follow-up (see Section 4). It is comforting that the 
MLE A = is, therefore, also the natural ad hoc estimator of A. 

In a medical setting, a prevalent cohort study with follow-up (Wang 1991) begins 
with the identification, from a sampled cohort, of those with existing (prevalent) disease. 
The dates of onset for the diseased are ascertained and the diseased subjects are followed 
forward in time until failure or censoring. Other data collected include the ages at the 
time of recruitment, the failure/censoring times of the subjects who are followed, and 
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covariates of interest to the researchers. There are two main features of the data collected 
from such studies. First, the dates of disease onset of the prevalent cases do not include 
the dates of onset of those who died prior to the start of the prevalent cohort study; 
we can only speculate as to the existence of such subjects. Hence, direct use of the 
observed dates of onset from a prevalence study, in contrast to dates of onset from an 
incidence study, leads to underestimation of the true incidence rate. Second, the observed 
failure/censoring intervals are left-truncated and, if the underlying incidence process is 
stationary, as is the assumption here, they are length biased; those with longer survival 
intervals are more likely to be observed (Wicksell 1925, Neyman 1955, Cox 1969, Patil 
and Rao 1978, and Vardi 1982, 1985). We address these difficulties in deriving the MLE 
(P, fx) and hence the MLE A, of the overall incidence rate. We use a similar approach 
to the estimation of age-specific incidence rates. 

Keiding (1991, 2006) used a Markov process model to derive carefully, the prevalence- 
incidence relationship, and proposed three different scenarios which facilitate estimation 
of the (constant) age-specific incidence rate when there are no follow-up data. In the 
first scenario it is assumed that there is non-differential mortality for the diseased and 
non-diseased. In Biering-Sorensen and Hilden (1984) this assumption is likely to be ten- 
able while in Keiding et al. (1989), it is probably not, since the disease under study is 
diabetes. In the second scenario, which Keiding invokes in his 1989 paper, no assump- 
tion of non-differential mortality is made. It is either assumed that the incidence rate 
is small and that the difference between the intensities from the healthy and diseased 
states to death is known or that the difference between these two intensities is small 
and known. In the third scenario, it is assumed that the joint relative intensity of the 
calendar time, age- and duration-specific mortality is known. Under each scenario a 
parametric assumption must be made, and in the last two scenarios certain population 
parameters must be known. Therefore, these estimators are strongly disease-specific and 
also dependent on the availability of certain population level data. These assumptions 
are needed to compensate for not having follow-up information. By following-up the 
prevalent cases we are able to avoid these assumptions. Our main assumption is that 
the underlying incidence process is a stationary Poisson process, an assumption that 
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Keiding also makes. Stationarity of the incidence rate holds, roughly, for many diseases: 
for example, amyotrophic lateral sclerosis (Sorenson et al. 2002), certain types of cancers 
(Jemal et al. 2005), and schizophrenia (Folnegovic and Folnegovic-Smalc 1992). It may 
not, however, be tenable for an infectious disease. 

Diamond and McDonald (1991) also considered incidence rate estimation from 
prevalent- case data, with no follow-up, again under different assumptions. Ogata et 
al. (2000) took an empirical Bayes approach to the analysis of retrospective incidence 
data. More recently Alioum et al. (2005) make HIV-AIDS-specific model assumptions 
to estimate a general incidence rate. To our knowledge there is no literature that pro- 
vides a general framework for maximum likelihood estimation of a constant underlying 
incidence rate when one has access only to prevalent cohort survival data with follow-up. 

The rest of this paper is organized as follows: In Section [2] we provide a careful 
formulation of a prevalent cohort study with follow-up, paying particular attention to 
survival data. In Section [3] we discuss the MLE for the underlying incidence rate. In 
Section HJ we present the asymptotic properties of the estimator, paving the way for 
computation of an approximate confidence interval for the underlying incidence rate. In 
Section [5] we extend our results to include age-specific incidence rates. In Section [6] we 
apply our methods to data collected as part of the Canadian Study of Health and Aging 
(CSHA), in order to estimate the underlying age-specific incidence rates of dementia 
amongst the elderly in Canada. 

2. General setup and notation 

Let Xi,X2, ...,X m be m i.i.d. positive random variables representing the survival 
times of individuals from onset of a disease, say, to an end point of interest. Let the Aj's 
have survivor function S(x) = P(Xi > x), cumulative distribution function F(x), and 
probability density function f(x). Define \i to be the mean survival time; that is, \i = 
/ °° S(x)dx. Suppose that Ti, t 2 , r m are the m calendar times of onset corresponding 
to Xi, X 2 , ...,X m and let r* be the calendar time of recruitment into a study. Individual 
% is observed in the study only if Xi > t* — Tj and, therefore, for i — 1, 2, m, Xi is left 
truncated with left truncation time Tj = r* — Tj. Since the onset times are random, the 
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truncation times are random variables, with distribution function denoted by G, and 
density g. Let Yi,Yi, ...,Y n be the observed left truncated lifetimes, with n < m. That 
is, P(Yi > x) = P{Xi > x\Xi > Ti). We borrow terminology from renewal theory and 
write Yi = Y bwd + Y/ wd , where Y bwd is the time from onset to recruitment into the study 
or the "backward recurrence time", and Y/ wd is the time from recruitment to failure, 
or the "forward recurrence time". Also, let Flb represent the distribution of the YiS, 
where the subscript, LB, is used to indicate that the Y^'s are length-biased. 

Suppose that individual % has censoring time C* = Y bwd + Cj, where Cj, which we 
call the "residual censoring time", is the time from recruitment until the individual is 
censored. We assume that P(C* > Tj) = 1 (see Wang 1991) and we thus observe only 
min(C*,Yj). Often, however, the backward recurrence times are fully observed, and 
we assume this to be the case here, so that the observed data are: (Y i , Y° bs , Si) i = 
1,2, ...,n where Y° bs = min(Y/ wd ,Ci) and Si = I[Y/ wd < Ci] indicates whether subject 
% has been followed until failure. Since C* and Yi share Y bwd , the full survival times, Yi, 
are informatively censored (Vardi 1989). It is still reasonable in many cases, however, 
to assume that Ci is independent of both Y/ wd and Y bwd , since independence between 
Ci and Y i d corresponds to the usual random censoring assumption. 

In summary, we differentiate between the potential failure times, X{, some of which 
will not be observed because of left truncation, and the observed failure/censoring times 
(Y bwd + Y° bs ); these are event times of the "long survivors". 

3. Point estimation of the incidence rate 

Under stationarity we assume the underlying incidence process is a Poisson process 
with constant intensity \(t) = A. Hence, the truncation time distribution, G, is uniform, 
conditional on the number of incident times in (0, r*) (Asgharian, Wolfson, and Zhang 
2006). 

We begin by deriving the MLE of A. We then derive the MLE of the age-specific 
incidence rates. The approach depends on the well-known relationship, P — A x \i 
where P is the time-independent point prevalence, A is the time-independent underlying 
incidence rate, and /x is the mean duration of the disease (see Keiding 1991). 
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Let a random number of iV prevalent cases be observed from a large group of s in- 
dividuals selected from screening. Fixing N at n, the realized number of prevalent cases, 
Asgharian, M'Lan, and Wolfson (2002) and Asgharian and Wolfson (2005) derived the 
unconditional NPMLE, S, of S under the assumption of stationarity. They established 
its asymptotic properties and those of the NPMLE, ft = J °° S(x)dx. Conditioning on 
N = n, the likelihood of the data is 

(1) L = L(S, P) = /(data, n; S, P). 

Write L(S, P) as L(S\n)L(P) = /(data; S\n)f(n; P), by sufficiency of N for P. Now, in 
practice, the data of a prevalent cohort study with follow-up are collected in two stages. 
In stage 1, a binary, 0-1, random variable, say £, is measured on each randomly selected 
subject to ascertain if the subject has experienced initiation of the disease. In stage 
2, we observe the triple (Y*' wd , Y° bs , Si) i — 1,2, ...,n on diseased subjects, indicated by 
£ = 1. The following tree diagram depicts our sampling scheme: 




Figure 1. Sampling scheme 



The full likelihood is: 

s 

(2) ^=n( i - p ) 1 ' 



i=l 



P 



( dF(y^ d + yl wd ) 



Si 



dF(uo) 



l-Si 



where P = P(£ = 1) is the time-independent point prevalence in the population. For 
the derivation of a similar likelihood see Asgharian and Wolfson (2005). As is readily 
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seen, the above likelihood can be factorized as 



(3) L 



i=l 



n 
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Joint maximization of ([2]) with respect to S and P gives the NPMLE (S, P) and hence 
the NPMLE (/a, P) where P = N/ s is the usual point prevalence estimator of P. It 
follows, by invariance, that A = P / fx is the unconditional NPMLE of A. It is seen that 
A, the MLE, is also the natural ad hoc estimator derived from the relation A = Pj /i, by 
replacing P and \x by their respective natural estimators. 

Wang (1991) derived the NPMLE G of G, the truncating distribution, by condi- 
tioning on the observed backward recurrence times. Although, under stationarity, G is 
uniform, and this observation allows one to informally assess stationarity, it does not 
lead to an estimate of A, since A is not uniquely determined by G. 

4. Interval estimation of the incidence rate 

To derive an asymptotic confidence interval for A we begin with the asymptotic 
properties of (A, P) which in turn requires a careful examination of the likelihood (j2j). 
Identity (HI) of Lemma 1, though simple, plays a key role in the derivation of an asymp- 
totic confidence interval for A as it facilitates the transferral of the asymptotic properties 
of fi and P to those of A. 

Lemma 1. Let A, P, and [i be respectively, the time-independent underlying incidence 
rate, the time-independent point prevalence and the mean duration of the disease. Let /t 
and P be the unconditional MLEs of p, and P respectively. Define A = j, the MLE of 
A. Then 

(4) X-X = ^[fi(P-P)-P((l- f x)} . 

Proof. The result follows immediately from the definitions of A and A. □ 

Theorem [1] below, which draws on Lemma dj essentially shows that A is consistent 
and asymptotically Normal. We state this result and provide the main steps of the proof 
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in the Appendix. Suppose 7 = sup{t : FiB(t) = 0}, and r = inf{t : FiB(t) = 1}. Then 
under mild conditions we have, 

Theorem 1. Suppose 7 > 0, r < 00, and ji = J °° xdF(x) < 00. Then as s — > 00 we 
have 

(a) I A — A I =■ o(yJ**^ 

(b) T = V ^(A-A)^^f^, 

where Z\ ~ N(0, A/i[l — A/i]) and Z 2 ~ iV(0, cr^/A/x) are independent, 

/* OO /* OO 1 1 

and u ) ^ s ^ e covariance function of the limiting process of S. 

The covariance function ip(u,v) has an intractable form (see Asgharian et al. 2002 and 
Asgharian and Wolfson 2005). The asymptotic variance of T has consequently a rather 
complex form which precludes the possibility of its direct estimation. Instead, we obtain 
a confidence interval for A by bootstrapping A. 

5. Estimating the age-specific incidence rate 

For many diseases the incidence rate is age- dependent, and estimators of age-specific 
incidence rates are almost always sought by epidemiologists. Following the notation from 
Section[2l let r* represent the calendar time of recruitment, let X be the time from onset 
to death, and r D be the calendar time of onset. Let D t be the event of being diseased 
and alive at time t, let A a be the age at onset, and A t the age at calendar time t. We 
assume that the distribution of X does not change with calendar time, and that both A Q 
and At are discrete random variables; the latter assumption can be relaxed to include 
arbitrary random variables. Then, 

P(D T . \A = z) = [ P(X > T *-t,T = t\A = z)dt 
Jo 

P(X>r*-t\r = t, A = z)dP TolAo (t | z) ■ 
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On the other hand, we have 

P(r G (t,t + dt),A = z) 



dPr \A (t I z) 



P(A = z) 

P(r Q e (t,t + dt),A t = z) 
P(A = z) 

P(A t = z) 



= dp r a \A t (t\z)x p{Ao = z) 
We also note that 

P(X > t* - t | t = t, A = z) = P(X > r* — t \ A a — z) — S z (r* — t) 
Having assumed that dP To \A t (t \ z)/dt = X z only depends on z, we obtain 

A 2 



P(D T * \A Q = z) 



S z (r*-t)P(A t = z)dt 



P{A Q = z) 



We thus find the age-specific incidence 

(5) \ z P ( D -r*' A c 



fZ S z (r* - t)P(A t = z)dt 
It follows by invariance that the MLE of A z is 

(6) *,= J^^H , 

/„ 5»(t« - t)P(A t = z)dt 

where P(D T *, A = z) is the observed proportion in the recruited cohort who are diseased 
and with age-at-onset z. Note that to find S z we begin by restricting our attention to 
the length-biased survival/ censoring times of the prevalent cases, whose onset occurred 
at age z. Then S z is the MLE of S z , based on these length-biased data, as derived by 
Asgharian et al. (2002) and Asgharian and Wolfson (2005). 

It is assumed that the population age distribution {P(A t = z)} z , may be routinely 
obtained from census data. Since census data are usually only updated every five years, 
a reasonable assumption is that P(A t = z) is piecewise constant as a function of t. 
However, as we shall see in Section [6] it might be possible to make the even stronger 
assumption that P(A t = z) = P(A — z), is roughly independent of t, without affecting 
X z substantially. An alternative which requires more intensive modeling, is to replace 
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the step function P(A t = z) by a smooth function of t. We suggest that the extra effort 
would probably result in very small improvement if any. 

Since, in Section El the population age distribution is assumed to be constant we 
restrict our attention to this case. Then equation §5§ reduces to 

(V\ \ - P (Dr*,A = z) 

(7) K - H z P(A = z) ' 

where P(A = z) is the proportion of subjects in age category z, and fi z represents the 
mean survival time in age category z. The information contained in the observations, 
for the case of three age categories [z = 1, 2, 3), may be illustrated through the following 
tree diagram, 




Figure 2. Illustration of information contained in the observations 



where 

\ if A a = z for the i-th subject, 
0, Otherwise. 

The full likelihood, for the general case z = 1,2,...,/ is, 



i=l 



n 

i=l 



dF(y^ d + y{ wd ^ 



dF(u)\i-8i 



\[P{A C 



Z=l 
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Using equation ((7j), 

(9) A - P ^^o = z ) 

is the MLE of A 2 , where /i 2 is the MLE of fi z derived from S z . 



6. Estimating the incidence rate of dementia 

In 1991, 10,263 elderly Canadians (65 years or older), living at home or in an 
institution, were screened for dementia (CSHA working group 1994). This phase of 
the study was known as CSHA-1. At the time of CSHA-1, 821 subjects were classified 
as having either possible Alzheimer's disease, probable Alzheimer's disease, or vascular 
dementia. Henceforth, by the term dementia we mean having exactly one of these 
three conditions since they constitute the vast majority of dementias. The approximate 
dates of onset were derived in a hierarchical fashion from the answers to three questions 
(Wolfson et al. 2001). In 1996, the second phase of the study, CSHA-2, was completed. 
CSHA-2 included the ascertainment of the date of death or right censoring for those 
cases identified at CSHA-1. These are the data upon which we shall base our estimates 
of the overall and age-specific incident rates of dementia. However, additional data 
were, in fact, collected as part of the CSHA with the goal of estimating the age-specific 
incidence rates of dementia among elderly Canadians. The subjects who were deemed 
not to have dementia at CSHA-1 were re-evaluated for dementia at CSHA-2. Assuming 
that these incidence rates had remained constant, they were estimated using the incident 
cases observed between CSHA-1 and CSHA-2. There were nevertheless, difficulties with 
these "incident" data since it could not be ascertained with certainty whether those 
who had died between CSHA-1 and CSHA-2 had become incident cases with dementia. 
In this paper we, therefore, re-estimated the incidence rates without relying on the 
"incident" cases that occurred between CSHA-1 and CSHA-2. The assumption of a 
roughly constant incidence rate for dementia has been previously checked in several 
ways and has been deemed to be reasonable (see Asgharian et al. 2002, Asgharian et al. 
2006 and Addona and Wolfson 2006). 
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Age Group 


Range 


SD 


cv 


65-74 


59.8 - 62.7% 


1.36% 


0.022 


75-84 


29.1 - 31.3% 


1.03% 


0.034 


85+ 


8.2 - 8.9% 


0.34% 


0.040 



Table 1. Range, standard deviation, and coefficient of variation for the 
percentage of Canadians (aged 65 and older) in each age group 

6.1. Estimating the overall incidence rate of dementia. The NPMLE, S(x), yields 
fx « 4.75 years or 57 months. Since the CSHA data did not constitute a random sample 
of all subjects over the age of 65 in Canada, we used the age-standardized prevalence 
estimate instead of simply, P — — . This gives an estimate for P of 0.066 (CSHA working 
group 1994), which leads to a point estimate, A = 0.0139, or 13.9 per 1,000 person-years. 
To obtain an interval estimate for A, we followed the bootstrap procedure and sampled 
with replacement from the 10,263 screened subjects to obtain 10,000 bootstrap samples 
of the same size. We obtained a confidence interval for A of [12.52, 15.28] cases per 1,000 
person-years. 

6.2. Estimating the age-specific incidence rate of dementia. Three age groups 
were considered for the CSHA data: 65-74, 75-84, and 85+ years old. The 821 cases of 
dementia were subdivided as follows: 164 had onset between 65 and 74 years old, 381 
had onset between 75 and 84 years old, and 276 were 85 or older when they had onset. 
The estimated mean survival times in years were 7.97, 5.16, and 3.50, for the 65-74, 
75-84, and 85+ groups, respectively. To use equation Qj, we require an approximately 
stable age distribution over the period covering the onset times. We consulted data from 
four Canadian censuses covering 1976-1991 to assess this assumption. Figure [3] shows 
the progression of the percentage of the Canadian population aged 65 and older in each 
of the three age groups (Statistics Canada 2006). Using data from the 1976, 1981, 1986, 
and 1991 censuses, we also computed some measures of variability for the percentage in 
each of the three age groups. These are presented in Table 1. 
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Figure 3. Percentage of Canadian population aged 65 and older in 65-74 
(circles), 75-84 (diamonds), and 85+ (triangles) age groups 



Having verified that the age distribution is roughly stable for this time period, we pro- 
ceeded with the age-specific incidence estimation using the 1991 census data. Amongst 
those 65 years or older in 1991, 59.8% were in the 65-74 group, 31.3% were in the 75-84 
group, and 8.9% were in the 85+ group (Statistics Canada 2006). The resulting age- 
specific incidence rate estimates are presented in Table 2. In 1976, amongst those 65 
years or older, 62.7% were in the 65-74 group, 29.1% were in the 75-84 group, and 8.2% 
were in the 85+ group (Statistics Canada 2006). We also estimated the age-specific 
incidence rates based on the census age distribution data from 1976 to take into account 
small changes in the population age distribution that might have occurred over the pe- 
riod from 1976 to 1991. When the age distribution changes, this approach provides a 
simple framework for investigating robustness of the age-specific incidence rate estimator 
to departures from the assumption of constancy of the age distribution. For comparative 
purposes, the age-specific incidence rate estimates based on the 1976 census data are 
also given in Table 2. 
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Age Group 


A 2 (1991) 


95% CI (1991) 


A 2 (1976) 


95% CI (1976) 


65-74 


3.35 


[2.72 , 3.99] 


3.20 


[2.58 , 3.82] 


75-84 


22.99 


[19.92 , 26.04] 


24.69 


[21.26 , 28.13] 


85+ 


85.86 


[70.52 , 101.20] 


93.39 


[77.00 , 109.77] 



Table 2. Age-specific incidence rate estimates per 1,000 person- years 
using 1991 and 1976 Canadian census data 



6.3. Discussion of dementia incidence rate estimates. Subjects were not prospec- 
tively monitored between CSHA-1 and CSHA-2. It was thus difficult to ascertain 
whether those who had died in this time period had had onset of Alzheimer's disease 
(possible or probable). As a result, incidence rates of Alzheimer's disease reported from 
the CSHA were underestimates of the true incidence rates amongst elderly Canadians 
since they were based only on subjects who survived until the end of CSHA-2 in 1996. 
The CSHA incidence rate estimates of Alzheimer's disease were 7.4 and 5.9 per 1,000 
person-years for women and men respectively (CSHA working group 2000), giving a 
crude estimated incidence rate for men and women combined of 6.7 per 1000 person 
years. Using the CSHA data, Hebert et al.(2000) estimated the incidence rate of vas- 
cular dementia to be 3.79 per 1,000 person-years. Therefore the CSHA overall (under-) 
estimated rate for dementia was approximately 6.7 + 3.79 = 10.49 per 1000 person 
years. Direct comparison with our results is difficult. Our overall estimate (for possi- 
ble or probable Alzheimer's, or vascular dementia) of 13.9 per 1,000 person-years seems 
to be consistent with these previous estimates obtained from the CSHA. An analogous 
comparison of the age-specific incidence estimates reveals that they too are consistent 
with those already obtained from the CSHA. Note that the slightly different point esti- 
mates for the age-specific incidence rates, particularly in the 85+ category, should not 
be interpreted as meaning that incidence rates declined or increased between 1976 and 
1991. They simply provide a range of possible values for the estimate depending on 
what is taken as the population age distribution. 
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7. Concluding remarks 

Simulations, whose results are not reported in this paper, suggest that our meth- 
ods work well for moderate sample sizes; the asymptotic distribution of the estimated 
incidence rates were close to Normal, the point estimates were close to their true values 
and the confidence intervals reasonably narrow for a range of parameter choices. 

Our estimator of the incidence rate depends on the estimator S, for the survival 
function S. We propose that the most efficient estimator for S, under the assumption of 
a constant incidence rate, should be used. The estimator, S, used in this paper is more 
efficient than the well-known estimator of S for general left truncation data (Wang 1991) 
which does not invoke stationarity of the incidence process (Asgharian et al. 2002). 
Indeed, it is possible to show that the estimators we present for the incidence rates 
(overall and age-specific) are asymptotically most efficient. This follows from Asgharian 
and Wolfson (2005, Theorem 3) and Van der Vaart (1998, Theorem 25.47, page 387). 

If the largest observed failure time is censored, S is left undefined beyond this point 
by most authors. Consequently fx is not well-defined in this situation. Fortunately, in 
our example, the largest survival time is a true failure time. Ad hoc "fixes" are available, 
but produce biased estimators. 

The CSHA data used for illustration is based on an initial cohort of 10,263 subjects 
obtained as a stratified cluster sample whereby a fixed number of institutionalized (about 
10%) and non-institutionalized (about 90%) subjects were sampled. In addition, those 
over 85 years old were over-sampled. We do not take into account the sampling scheme 
in our estimated incidence rates or in the asymptotic distributions of our estimators. To 
do so requires development of new theory allowing for within cluster dependence, which 
is a topic for further study and is not directly pertinent to our methods. 
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Appendix 

We provide a road map of the proof of Theorem [1] and give further details about 
steps (iv) and (v) below. Road map of the proof: 

(i) Establish the asymptotic behavior of Flb- 

(ii) Establish the asymptotic behavior of F using (i). 

(iii) Establish the asymptotic behavior of fx using (ii). 

(iv) Establish the independence of fx and P. 

(v) Establish the asymptotic behavior of A using (T4J), (iii), and (iv). 

The derivation of (i) is similar to its counterpart given by Asgharian and Wolfson (2005), 
and those of (ii) and (iii) are similar to their counterparts in Asgharian et al. (2002). 
The expressions, however, are slightly different in view of the different sampling scheme 
under consideration here. We therefore sketch the proof of steps (iv) and (v). 
Step(iv): 

In this step we justify the independence of P and F, and hence of P and fx. This 
independence is suggested by the likelihood factorization ([3]). Theorem [2] shows that 
this is in fact the case. First, observe that it follows from (J2]) that P = — is the NPMLE 
of P. We have 

(io) n s = V~s{P-p) 4jv(o,p(i-p)) . 
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Theorem 2. Let R x Dq[0, t] be endowed with the topology induced by 

|| (a,x) || = | a | +sup ae[(M] | x(s) \ 
Under the assumptions of Theorem 1 

(U S ,U N ) 4 (W,U) in RxD [0,t], 
where U is given in Theorem 1 and W ~ iV(0, P(l — P)) independent of U. 

Proof. It follows from Theorem 7.2.1 and Lemma 7.2.1 of Csdrgo and Revesz (1981) that 

|| (n s , Z7jv) — (n s , t/[ s p]) || — >0 as s oo 

It remains to show that U s and Uup\ are independent. This follows from the fact that 
P is a partial ancillary for Flb-, while {(l^ 6u,d , "Kj 065 , i = 1, 2, • ■ • , [sP]} is partially 
sufficient for Flb- D 

Step(v): 

In this final step we combine the results of steps (i) through (iii), in Theorem [TJ to yield 
the asymptotic behaviour of A. 

Proof of Theorem d Part (a) follows from Lemma (TJ part (b) of Theorem 1 of 
Asgharian et al. (2002), and the asymptotic properties of the sample proportion in Bi- 
nomial sampling. Part (b) follows from part (c) of Theorem 1 of Asgharian et al. (2002), 
(TTOl), and the identity (HI). □ 
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